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File: USPT 



Oct 23, 1990 



US-PAT-NO: 4965763 

DOCUMENT- IDENTIFIER: US 4965763 A 

TITLE: Computer method for automatic extraction of commonly specified information from 
business correspondence 

DATE-ISSUED: October 23, 1990 



INVENTOR- INFORMATION : 
NAME 

Zamora; Elena M. 



CITY 

Chevy Chase 



STATE 
MD 



ZIP CODE 



COUNTRY 



ASS IGNEE - INFORMATION : 
NAME 



CITY STATE ZIP CODE COUNTRY TYPE CODE 



International Business Machines Corporation Armonk NY 



02 



APPL-NO: 7/ 308955 [PALM] 
DATE FILED: February 6, 1989 

PARENT -CASE: 

This is a continuation of U.S. patent application Ser. No. 021,078, filed Mar. 3, 1987, 
now abandoned. 

INT-CL: [5] G06F 15/40 

US-CL-ISSUED: 364/900; 364/918, 364/963, 364/963.1, 364/419 
US-CL -CURRENT: 704/1 
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OTHER PUBLICATIONS 

G. E. Heidorn, et al . , "The Epistle Text -Critiquing System," IBM Systems Journal, vol. 21, 
No. 3, pp. 305-236 (1982) . 

H. Tennant, et al . , "Menu-Based Natural Language Understanding," Proc . 21st Ann. Mtg. 
Assoc. for Computational Linguistics, pp. 151-158, Jun. 1983. 

M. S. Palmer, et al . , "Recovering Implicit Information, " Proc. of 24th Annual Meeting of 
the Assoc. for Computational Linguistics, pp. 10-19, Jun. 1986. 

E. M. Zamora, et al . , "Extraction of Chemical Reaction Information from Primary Journal 
Text Using Computational Linguistics Techniques, ..." J. Chem. Inf. Comput . Sci., 24, 



lof 



jpp. 176-181, 1984, 
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E. M. Zamora, et al . , "Extr^Riion of Chemical Reaction InforWtion" from Primary Journal 
Text Using Computation Linguistics Techniques, J. Chem. Inf. Comput . Sci . , 24, pp. 
181-188, (1984) . 

ART-UNIT: 237 

PRIMARY -EXAMINER: Chan; Eddie P. 
ATTY -AGENT -FIRM: Hoel; John E. 



ABSTRACT: 

A Parametric Information Extraction (PIE) system has been developed to identify 
automatically commonly specified information such as author, date, recipient, address, 
subject statement, etc. from documents in free format . The program-generated data can be 
used directly or can be supplemented manually to provide automatic indexing or indexing 
aid, respectively. 
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File: USPT 



Apr 3, 1990 



US -PAT-NO : 4914590 

DOCUMENT- IDENTIFIER: US 4 914590 A 

TITLE: Natural language understanding system 

DATE-ISSUED: April 3, 1990 



INVENTOR- INFORMATION : 
NAME 

Loatman; Robert B. 
Post; Stephen D. 
Yang ; Ch i h - Ki ng 
Hermansen; John C. 



CITY 

Vienna 

McLean 

Rockville 

Catharpin 



STATE 

VA 

VA 

MD 

VA 



ZIP CODE 



COUNTRY 



ASS IGNEE - INFORMATION : 
NAME 

Emhart Industries , Inc . 



CITY STATE 
Indianapolis IN 



ZIP CODE 



COUNTRY 



TYPE CODE 
02 



APPL-NO: 7/ 195237 [PALM] 
DATE FILED: May 18, 1988 

INT-CL: [4] G06F 15/21, G06F 15/18, G06K 9/62 

US-CL-ISSUED: 364/419; 364/200, 364/274.8 
US -CL- CURRENT: 704 /8 

FIELD -OF -SEARCH : 364/200, 364/900, 364/274.8, 364/419, 364/917.92 
PRIOR-ART-DISCLOSED : 



OTHER PUBLICATIONS 

Loatman, R. B. & McCown, M. G. , Information Extraction from Natural Language Messages, 
Reprint from Proceedings of ESIG- -Third Annual Expert Systems in Government Conference, 
Oct. 19-23, 1987. 

Loatman, R. B., A Hybrid Architecture for Natural Language Understanding, Reprint from 
Proceedings of SPIE- -International Society of Optical Engineering, May 18-20, 1987. 
R. B. Loatman article, Natural Language Text Understanding, at pp. 2-5 of Jul. 1986, 
Technology Newsletter (Assignee Internal Publication) . 

Undated Brochure of Assignee, "Innovative Solution for AMHS " , (author unknown). 
M. Bates, 1987, "The Theory & Practice of Augmented Transition Network Grammers", in L. 
Bole (ed.) Natural Language Communication with Computers, New York, Springer. 
Winograd, T., 1983, Language as a Cognitive Process, vol. 1: Syntax, Reading, Mass: 
Addison-Wesley, (Chapters, Appendix D) , 195-271, 583-599. 

Quirk, R., Greenbaum, S., Leech, G. , & Svartvik, J., 1985, A Comprehensive Grammer of the 
English Language, New York: Seminar Press (excerpts from Chapter 10, Appendix I) . 
Sager, N. , 1981, Natural Language Information Processing: A Computer Grammar of English 
and Its Applications, Reading, Mass.: Addison-Wesley (excerpts from Appendix I). 
Wilks, V., Haung, X., & Fass, D., 1985, "Syntax, Preference, and Right Attachment", 
Proceedings of the Ninth I JCAI . 

Loatman, R. B., 1988, "Natural Language Text Understanding", Article to be published in 
assignee Newsletter, (cf. reference AT). 

Dept. of Navy, NOSC, May 21, 1987, Memo Concerning Conference Later Held (at which various 
NLU Systems were eventually demonstrated) . 

Schank, R., 1985, Conceptual Information Processing, New York: North-Holland, (excerpts 
from Chapter 3) . 

Cook, W., 1979, Case Grammer: Development of the Matrix Model, Washington, D.C.: 
1 of Georgetown University Press, (excerpts). 6/6/02 4:04PM 
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Laffal, J., 1973, A Concepl^Jictionary of English, Essex, Coin.: Gallery Press, 
(excerpts) . 

Marcus, M . , 1980, Theory of Syntactic Recognition for Natural Language, Cambridge, Mass.: 
MIT Press, (excerpts) . 

ART-UNIT: 236 

PRIMARY -EXAMINER: Jablon; Clark A. 
ATTY- AGENT- FIRM: Forest; Carl A. 



A hybrid natural language understanding (NLU) system which is particularly designed for 
processing natural language text. Primary functional components of the NLU system include 
a preprocessor; a word look-up and morphology module which communicates with a lexicon and 
a learning module; a syntactic parser which interfaces with an augmented transition 
network (ATN) grammar; a case frame applier, which converts the syntactic structure into 
canonical, semantic "case frames"; and a discourse analysis component which integrates 
explicit and implied information in the text into a conceptual structure which represents 
its meaning. This structure may be passed on to a knowledge based system, data base, to 
interested analysts or' decision makers, etc. Significant feedback points are provided, 
e.g., the case frame applier may notify the syntactic parser of a semantically incorrect 
parse, or the syntactic parser may seek a semantic judgment based on a fragmentary parse. 
This system incorporates a novel semantic analysis approach based largely on case grammar. 

65 Claims, 57 Drawing figures 
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File: USPT 



Apr 3, 1990 



DOCUMENT -IDENTIFIER: US 4914590 A 

TITLE: Natural language understanding system 



Abstract Paragraph Left (1) : 

A hybrid natural language understanding (NLU) system which is particularly designed for 
processing natural language text. Primary functional components of the NLU system include 
a preprocessor; a word look-up and morphology module which communicates with a lexicon and 
a learning module; a syntactic parser which interfaces with an augmented transition 
network (ATN) grammar; a case frame applier, which converts the syntactic structure into 
canonical, semantic "case frames"; and a discourse analysis component which integrates 
explicit and implied information in the text into a conceptual structure which represents 
its meaning. This structure may be passed on to a knowledge based system, data base, to 
interested analysts or decision makers, etc. Significant feedback points are provided, 
e.g., the case frame applier may notify the syntactic parser of a semantically incorrect 
parse, or the syntactic parser may seek a semantic judgment based on a fragmentary parse. 
This system incorporates a novel semantic analysis approach based largely on case grammar. 



Brief Summary Paragraph Right (4) : 

However, there is another NLU application that is less publicized but much more important. 
Even if the information in a data base is readily accessible, how accurate and timely is 
that information For example, in message processing applications, many messages arrive at 
an intelligence center in an unformatted, "free text" form (i.e., natural language). No 
present NLU system can account for all of English, and in order to accomplish any useful 
work with such a system, it is built with a specific, limited task in mind. The linguistic 
structures and vocabulary that a system can handle are specifically targeted to an 
application domain and expected text input format. A special use of language peculiar to a 
domain is often referred to as a "sublanguage", a term encompassing dialects and jargons. 
A significant part of an NLU developer's job is to discover the characteristics of a 
sublanguage and specify them for the requirements of an NLU development system. 

Detailed Description Paragraph Right (123) : 

PARTUS includes data bases of information about words (lexical information) and concepts 
( semantic information) . This section deals primarily with the lexical component. The 
conceptual component is mentioned in a few places where it interfaces with the lexical 
one. It is fully discussed below at Section 5. Section 4.1 defines the current PAKTUS 
lexical categories and their associated features. The overall structure and operation of 
the lexicon is explained in section 4.2. Finally, section 4.3 explains how the lexicon is 
extended. 

Detailed Description Paragraph Right (143) : 

PAKTUS recognizes a prescribed set of "affixes"" (prefixes and suffixes) and knows how 
they ordinarily affect the lexical and semantic content of words to which they are added. 
It also has rules specifying how they alter the spelling when combined with a root (e.g., 
when adding -ly to a word ending in "y" first change the "y" to "i" as in happy 
. fwdarw. happily) . A subset of these (-s, -'s, -er -est, -ly, -ed, -ing, and -th, along 
with their spelling variants) have completely predictable effects when added to certain 
word classes (e.g., -er always makes an adjective comparative, although it is less certain 
what it does to a verb) For this subset, PAKTUS derives the information about the 
inflected form when it is first encountered and keeps it in virtual memory. It does not 
ask the dictionary officer for confirmation of the derivation since there is no doubt, nor 
does it write the derived form permanently in the lexicon data base. This saves space with 
little or no cost in processing time. 

Current US Original Classification (1) : 
704/8 
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□ 1. Document ID: US 20020010714 Al 

L22: Entry 1 of 3 File: PGPB Jan 24, 2002 

PGPUB - DOCUMENT - NUMBER : 20020010714 
PGPUB - F I L ING - TYPE : new 

DOCUMENT- IDENTIFIER: US 20020010714 Al 

TITLE: Method and apparatus for processing free-format data 
PUBLICATION-DATE: January 24, 2002 
INVENTOR- INFORMATION : 

NAME CITY STATE COUNTRY RULE -4 7 

Hetherington, Greg Kareela AU 

US -CL- CURRENT: 707/505; 707/508 



□ 2. Document ID: US 6272495 Bl 

L22: Entry 2 of 3 File: USPT Aug 7, 2001 

US-PAT-NO: 6272495 

DOCUMENT- IDENTIFIER: US 6272495 Bl 

TITLE: Method and apparatus for processing free-format data 
DATE-ISSUED: August 7, 2001 

INVENTOR- INFORMATION : 

NAME CITY STATE ZIP CODE COUNTRY 

Hetherington; Greg Kareela New South Wales 2232 AUX 



US -CL- CURRENT: 707/101; 707/102, 707/4, 707/531 



ieview Classification 



□ 3. Document ID: US 2002010714 Al WO 9848360 Al AU 9870147 A EP 1078323 Al US 
6272495 Bl CN 1315020 A 

L22: Entry 3 of 3 File: DWPI Jan 24, 2002 

DERWENT-ACC-NO: 1998-583882 
DERWENT-WEEK: 2 00210 

COPYRIGHT 2 002 DERWENT INFORMATION LTD 

TITLE: Free format data processing method e.g. for computer - processing free format data 
to produce text object associated with free format data with text object has several 
component nodes containing attribute type identifiers for elements of free format text and 
other data 

INVENTOR: HETHERINGTON, G 
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